End-to-End Fusion Network of Deep and Hand-Crafted Features for Small Object Detection
نویسندگان
چکیده
Recent advances in deep learning have enabled state-of-the-art performance detecting medium and large-size objects. However, small object detection remains challenging primarily due to the scarcity of information. This paper proposes an end-to-end fusion network that integrates hand-crafted features address this limitation. A module based on semantic context information is designed enhance feature discrimination ability. Additionally, we introduce a kind feature-contrast loss incorporate prior knowledge into according contrastive learning. Experiments MS COCO (34.4% ${\mathrm {A}}{{\mathrm {P}}_{\mathrm {S}}}$ ) PASCAL VOC (85.9% mAP) datasets demonstrate our approach achieves improved accuracy over previous methods, especially for
منابع مشابه
Person re-identification with fusion of hand-crafted and deep pose-based body region features
Person re-identification (re-ID) aims to accurately retrieve a person from a large-scale database of images captured across multiple cameras. Existing works learn deep representations using a large training subset of unique persons. However, identifying unseen persons is critical for a good re-ID algorithm. Moreover, the misalignment between person crops to detection errors or pose variations l...
متن کاملAffordanceNet: An End-to-End Deep Learning Approach for Object Affordance Detection
We propose AffordanceNet, a new deep learning approach to simultaneously detect multiple objects and their affordances from RGB images. Our AffordanceNet has two branches: an object detection branch to localize and classify the object, and an affordance detection branch to assign each pixel in the object to its most probable affordance label. The proposed framework employs three key components ...
متن کاملEnd-to-End Deep Neural Network for Automatic Speech Recognition
We investigate the efficacy of deep neural networks on speech recognition. Specifically, we implement an end-to-end deep learning system that utilizes mel-filter bank features to directly output to spoken phonemes without the need of a traditional Hidden Markov Model for decoding. The system will comprise of two variants of neural networks for phoneme recognition. In particular, we utilize conv...
متن کاملBaby Cry Sound Detection: A Comparison of Hand Crafted Features and Deep Learning Approach
Baby cry sound detection allows parents to be automatically alerted when their baby is crying. Current solutions in home environment ask for a client-server architecture where an end-node device streams the audio to a centralized server in charge of the detection. Even providing the best performances, these solutions raise power consumption and privacy issues. For these reasons, interest has re...
متن کاملAutomatic Lung Cancer Detection and Diagnosis Using Hand Crafted and Deep Learning Features
This paper presents a lung nodule detection and classification system which utilizes a combination of hand crafted and deep learning features. Hand crafted features were obtained from modified methods of bag of frequencies, and taxonomic indices. We included a robust radius estimation algorithm that resulted in an average error of 1.29 pixels. Hand crafted features were obtained from 3D low dos...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2023
ISSN: ['2169-3536']
DOI: https://doi.org/10.1109/access.2023.3283439